Cocktail Party Problem: Source Separation Issues and Computational Methods

نویسندگان

  • Tariqullah Jan
  • Wenwu Wang
چکیده

The concept of the cocktail party problem (CPP) was coined by Cherry (1953). It was proposed to address the phenomenon associated with human auditory system that, in a cocktail party environment, humans have the ability to focus their listening attention on a single speaker when multiple conversations and background interferences and noise are presented simultaneously. Many researchers and scientists from a variety of research areas attempt to tackle this problem (Bregman, 1990; Arons, 1992; Yost, 1997; Feng et al., 2000; Bronkhorst, 2000). Despite of all these works, the CPP remains an open problem and demands further research effort. Figure 1 illustrates the cocktail party effect using a simplified scenario with two simultaneous conversations in the room environment. As the solution to the CPP offers many practical applications, engineers and scientists have spent their efforts in understanding the mechanism of the human auditory system, and hoping to design a machine which can work similarly to the human abStract

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Improved Cocktail - Party Processing

The human auditory system is able to focus on one speech signal and ignore other speech signals in an auditory scene where several conversations are taking place. This ability of the human auditory system is referred to as the “cocktail-party effect”. This property of human hearing is partly made possible by binaural listening. Interaural time differences (ITDs) and interaural level differences...

متن کامل

Ica Based Blind Source Separation of Sound Using Matlab

Blind source separation is the separation of a set of signal from a set of mixed signals, without the aid of information (or with very little information) about the source signals or the mixing process. Source separation problems in digital signal processing are those in which several signals have been mixed together and the objective is to find out what the original signals were. The classical...

متن کامل

Monaural Audio Speaker Separation Using Source-Contrastive Estimation

We propose an algorithm to separate simultaneously speaking persons from each other, the “cocktail party problem”, using a single microphone. Our approach involves a deep recurrent neural networks regression to a vector space that is descriptive of independent speakers. Such a vector space can embed empirically determined speaker characteristics and is optimized by distinguishing between speake...

متن کامل

Probabilistic Binary-Mask Cocktail-Party Source Separation in a Convolutional Deep Neural Network

Separation of competing speech is a key challenge in signal processing and a feat routinely performed by the human auditory brain. A long standing benchmark of the spectrogram approach to source separation is known as the ideal binary mask. Here, we train a convolutional deep neural network, on a twospeaker cocktail party problem, to make probabilistic predictions about binary masks. Our result...

متن کامل

Cocktail Party Solutions: Mixing techniques

The human auditory system has an unparalleled ability to solve what is known as the cocktail problem: the task of perceptually separating superimposed acoustic sources in a noisy environment. Information about the auditory system can inform our choices of representation and computation. Recent progress has been made from several diier-ent approaches to the acoustic source separation problem. Co...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016